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(1) GENERAL INFORMATION: 

(i) APPLICANT: Fiine , Jasper D. 

Bpyartchuk, Victor L. 
Afehby, Matthew N. 



(ii) TITLE OF II 



riON: AFC1 AND RCE1 I SOPRENYLATED CAAX 
PROCESSING ENZYMES 



(iii) NUMBER OF SEQUENCES : 4 

( iv) CORRESPONDENCE! ADDRESS : 

(A) ADDRESSEE! Pennie & Edmonds LLP 

(B) STREET: 13155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New\York 

(E) COUNTRY : US 

(F) ZIP: 10036 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE A Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Pattentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Mandel , ^raLynn 

(B) REGISTRATION NUMBER: 31,853 

(C) REFERENCE/DOCKET NUMBER: 9067-008-999 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) \493 -4935 

(B) TELEFAX: (415) 433-5556 

(C) TELEX: 6 6141 PENNfEE 




(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1825 base Aairs 

(B) TYPE: nucleic acid' 

(C) STRANDEDNESS : unknc 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ1 ID NO : 1 : 

ACCTACCTTT TTTTCTATCT TCAACAACGA AACGCCTTAC ACACACACAC ACATACATCT 

ACATACATAC ATACAAATAT ACATATATGT AflACTTGTAT ATTCATTCCT ATTAACCAAA 

AAGAGGCAAT TAAACTTTTC CCTCTTTTTC TpJcGTCATTT ACTCAAAAAC TCTAATfCCT 

TCGTCTCTGT TCTGCCATTT TCTCCAGAAA AAfcATCGACG GGAAATAAAA AAAAAAAGAC 

AACGAACAAG AGAAAAAGTT CGCGAATTAT AA^CCACTTC TATAATTAAC AGGAAAAGGA 
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240 
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AGGAAAAAAA AGGAGGAAAA GAAAACTGCA GGCCTTTATT CATGTTTGAT CTTAAGACGA 360 

TTCTCGACCA TCCTAATATSC CCGTGGAAAT TAATCATTTC TGGGTTCTCG ATTGCCCAAT 42 0 

TTTCTTTCGA ATCTTACTTG ACGTACAGAC AGTACCAGAA GCTATCTGAA ACAAAGTTGC 4 80 

CACCTSTGCT GGAAGACGAa\ ATTGATGATG AAACTTTTCA TAAATCAAGG AACTACTCCC 54 0 

GGGCCAAGGC CAAGTTCTCC ATTTTCGGTG ACGTCTATAA CCTAGCCCAA AAGCTAGTTT 6 00 

TCATCAAATA CGACCTCTTC OCTAAAATCT GGCACATGGC CGTTTCTTTA TTGAATGCAG 660 

TCCTGCCAGT CAGATTTCAT ATGGTCTCCA CTGTCGCACA GAGTTTTATG CTTCTTGGGT 72 0 

CTCTTATCCA GTTTGTCTAC Ct4gGTTGAT TTGCCACTCT CTTACTATAG CCATTTTGTC 780 

CTGGAAGAAA AATTTGGTTT CAATAAATTG ACCGTCCAAC TATGGATCAC CGATATGATC 84 0 

AAGAGTCTGA CTTTGGCGTA TGCmTTGGT GGCCCAATCC TTTACCTGTT CCTTAAGATC 900 

TTTGATAAAT TCCCTACTGA TTTCOTTTGG TACATTATGG TCTTCTTGTT CGTTGTCCAA 960 

ATCTTAGCCA TGACAATCAT TCCAGTECTTC ATCATGCCCA TGTTTAATAA GTTCACTCCA 1020 

TTGGAGGACG GTGAACTGAA AAAATCTATT GAAAGTTTGG CCGATAGAGT TGGGTTCCCT 1080 

CTAGATAAGA TTTTTGTCAT TGACGGCTCA AAAAGATCTT CTCATTCAAA CGCATATTTC 114 0 

ACAGGTTTGC CATTCACCTC CAAGAGAMTT GTTTTGTTCG ACACTTTAGT GAACAGTAAT 12 00 

TCTACTGATG AAATTACGGC TGTTTTGGGC CATGAAATCG GTCACTGGCA AAAAAACCAC 12 60 

ATCGTTAATA TGGTCATCTT TAGTCAATTG CACACCTTCC TCATTTTCTC CCTTTTCACC 1320 

AGCATCTACA GAAATACATC ATTTTACAAQ ACCTTCGGCT TTTTCTTAGA GAAGT C C ACT 13 80 

GGCAGTTTTG TTGATCCCGT TATCACTAAG IGAATTCCCCA TTATCATTGG ATTTATGTTA 144 0 

TTTAACGACT TATTAACTCC ACTCGAATGT GCCATGCAAT TCGTGATGAG TTTAATTTCC 150 0 

AGAACTCATG AATATCAAGC TGATGCTTAT GfcTAAAAAAT TGGGCTACAA GCAAAATCTA 156 0 

TGTAGGGCTC TAATTGATCT ACAAATCAAA AACCTTTCCA CCATGAATGT AGATCCTCTG 162 0 

TATTCTAGCT ATCATTATTC CCATCCAACT CTAGCTGAAA GATCGAC CGC TCTAGACTAT 168 0 

GTTAGTGAAA AGAAGAAAAA CTAATCTATA GAGTACACAT ATTAGCATGT ACCGTTAAAT 174 0 

TCAGCTTCGT TATGTCTATA TCTACATACA TACJ^CAGGTA TCTACTATAA GAATAAAGGA 18 0 0 

AAGAAAAAAT AAACGATTAA ACATT \ 182 5 

2) INFORMATION FOR SEQ ID NO : 2 : \ 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 2850 base pairs \ 

(B) TYPE: nucleic acid \ 

(C) STRANDEDNESS : unknown \ 

(D) TOPOLOGY: unknown \ 

(ii) MOLECULE TYPE: DNA (genomic) \ 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 \ 



TGAACTGTTG ATGAACAAAG AGAAGCTGAC AAGCATC 



GCTTTGTACG ATGATTTCCA 



60 



TTCAAAAATT TGTGAAITATG AAACCAAGTT CAACAAGAAT TTTCTTGAAT TAAATGAGTT 120 

ATATAATATG AATAGGGGAG ACCGTAGGCC AAAGGAACTG AAATTTACAG ATTTTATTAC 18 0 

TTCACAGCTG TTTAACGATA TCGAAAGCAT TTGCAACTTG AAAGTTAGTG TTCACAACTT 24 0 

ATCCAACATT TTTAAAAAAC AGGTCAGTAC CCTAAAACAA CACTCAAAGC ACGCATTATC 3 00 

TGAGGATTCA ATATCGCAQA CAGGTAACGG TAGTTCATCG TCGCCCAGTT CAGCGTCATT 360 

AACGCCAGTA ACT TCTTCATk CC AAGAGTAG TTTATTTTTA CCTAGCGGTA GCTCGTCTAC 42 0 

TTCCCTGAAA TTTACAGACC y^GATTGTTCA TAAATGGGTT AGGATTGCTC CTTTACAGTA 4 80 

CAAACGAGAC ATTAATGTGA ACTTGGAATT TAATAAGGAC ATTAAGGAAA CTTTAATTCC 54 0 

AAGTTTTGAA AGCTGCCTAT G'KTGTAGGTT TTATTGCGTT CGAGTAATGA TTAAATTTGA 600 

AAAC CATCTT GGCGTAGCGA AGJSCTTGATAT CCCTATTTCT GTTAGGCAAG TGACAAAATA 660 

AAAAAACATT AGAAAAAATT CTCOTTACTT TTCTTATAGA TATAGATATA TGTATGGTTT 72 0 

GCTTATAGAT GAAGGTATTT ATCGOBTCCT TTGTATTCCC TATTATTAAT AAAATTCTTT 780 

TAAAATGCAT TTTCTGGTGC TCTTTTOTTG CTTCTGTATT TTTTTTTTTT TGGACCACTG 840 

GATGGAAAAC CTTTGATGAT TTTATTACCT TTATTTTAAC TTACTAAAAT ATCGAGATTT 900 

CAGGAACAAA ACATAGAATT TTCTTTGTCA AGAAAAATAA AACGAAATAA ATTGATGCTT 960 

TGACTACTGA CTGTCTGTCA TAGAGAGAM CAGAACAGCA ATGCTACAAT TCTCAACATT 102 0 

TCTAGTGCTC CTATACATCT CCATATCCTaX TGTGCTACCG CTATATGCAA CTTCACAACC 1080 

AGAAGGGTCT AAACGAGATA ATCCTCGAAC GATTAAATCT CGCATGCAAA AACTTACAAT 114 0 

TATGCTAATT TCCAACCTTT TTTTGGTGCC TTTTTTACAA TCTCAATTAT CTAGTACCAC 12 00 

TTCACATATA AGTTTCAAGG ACGCATTTTT ACTJCTTAGGT ATTATCCCAG GTTATTACGC 1260 

TGCATTGCCA AACCCTTGGC AATTCAGCCA GTTfcGTGAAA GACTTAACGA AATGTGTTGC 1320 

GATGTTATTG ACCTTATATT GTGGACCCGT TTTTvGATTTT GTATTATATC ATTTATTAAA 1380 

TCCAAAGAGC TCTATACTTG AAGATTTTTA CCATQAATTC CTGAATATTT GGAGTTTCAG 1440 

GAATTTTATA TTTGCACCAA TAACTGAGGA AATATTCTTAC ACGTCAATGC TTTTGACTAC 15 00 

GTACTTAAAC CTAATACCGC ATTCGCAACT AAGCTATCAA CAGTTATTTT GGCAACCATC 1560 

GCTTTTTTTT GGACTTGCGC ACGCACACCA TGCTTATQAG CAATTACAGG AAGGCTCCAT 162 0 

GACAACTGTT TCCATTCTGC TGACAACATG CTTCCAAATtT TTATACACAA CACTTTTTGG 168 0 

AGGGTTAACC AAGTTTGTAT TGCAATATCA TGGGGTTTAffl TGGTCCTTCA AGATTGAATT 174 0 

ACATTTCAC AGTAGTAGAC AAGAAAGCTG GACGCATTTC VcAAATTGGTC TCTCAATCTG 18 00 

GAATAAGTGC TACTTCGCAC TGCTGGTCCT TGGATTAATA TCCCTGAAGG ATACCTTACA 1860 

AACTCTGGTA GGAACTCCTG GTTATAGAAT AACCCTTTAG CCTTTTTTAC GTACTTGTAT 192 0 

ACCGTTTAAA ATTTCCTATG TACTATAACC TTTTTTCACT ACTTATTATGG AATTCTATCG 198 0 

AGCGACCGGG CTTTTGTTAC GGAAGAGTGA AAAAATCGAG TTTETGGTGTT TTGGTGAAAG 2 04 0 

AATTTGGAGG AC T AT AAAGT ACCTATACTT TGTATTACGG ACT^AATAAC AAGTCGTTCG 2100 
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TGTCAGTGGT 


ATTGAAGTTG 


TCAGATCTAA 


GAGTAGAGAG 


AAGGTGGCAT 


CTAATAGGTT 


2160 


TCGACGTTTT 


TCTTTTTTTA 


AqGTTTTTAT 


TTGGTCTCCT 


AGAATTTAAG 


GTCTTAGTTA 


2220 


GTTTTGGTTT 


GTTTTGTGGG 


TT AC AT AT TT 


TCAATTCAAA 


GGAGAATTTA 


GCTGTCTTTT 


2280 


AT AATG T C C A 


TV l"P 7V /*"» TV /""l TV T 1 7V TV 

ATAGAGATAA 


CG A<E AG C ATG 


CTGCGTACTA 


/~i TV rp/-1 TV TV /"> f*f* TV 

CATCAAGCGA 


m tv tv /"i /~< /"i /"i tv m/^ 

T AAGG C GAT C 


2340 


GCTAGTCAAA 


1 .'-I /— 1 7\ rp TV TV TV 

GGGATAAACG 


TV TV /^JP /irp/^ TV TV 

G AAGtl C TG AA 


GTTTTGATTG 


CTGCACAGTC 


C C TTG ACAAT 


2400 


GAAATCCGCA 


GCGTAAAAAA 


CCTAAAAAGA 


TTGTCGATTG 


GGTCAATGGA 


TTTACTTATT 


2460 


GATCCAGAAT 


TAGATATAAA 


ATTCG&TGGG 


GAATCTAGTG 


GGAGACGATC 


ATGGTCTGGC 


2520 


ACGACATCCA 


GTTCTGCGTC 


AATGCCAAGT 


G AC AC AAC C A 


CCGTTAATAA 


CACACGATAT 


2580 


AGCGATCCAA 


CTCCGCTAGA 


GAACTTGCAT 


GGGAGGGGTA 


ACTCAGGGAT 


AGAATCCTCC 


2640 


AATAAGACTA 


AAATTAAATG 


CTAACGTOTT 


AAAGAAAAAC 


TTATTATGGG 


TTCCCGCCAA 


2700 


TCAACACCCT 


AACGTTAAGC 


CTGATAATXT 


CCTAGAGCTT 


GTACAAGATA 


CTTTACAAAA 


2760 


TATACAACTA 


AGCGACAATG 


GTGAAGATAA 


TGATGGGAAT 


AGCAATGAAA 


ATAACGATAT 


2820 


TGAGGATAAT 


GGGGAGGATA 


aagaatcaca\ 








2850 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 amino aci\ 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..451 

(D) OTHER INFORMATION: /note= V'The protein sequence of 
the Afclp protein from yeast presented as a polypeptide 
sequence" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Met Phe Asp Leu Lys Thr lie Leu Asp His Pro Asn Thr Pro Trp Lys 
15 1Q 15 

Leu lie lie Ser Gly Phe Ser lie Ala Gin Phe Ser Phe Glu Ser Tyr 
20 25 \ 30 

Leu Thr Tyr Arg Gin Tyr Gin Lys Leu Ser\Glu Thr Lys Leu Pro Pro 
35 40 \ 45 

Val Leu Glu Asp Glu lie Asp Asp Glu Thr the His Lys Ser Arg Asn 
50 55 I 60 

Tyr Ser Arg Ala Lys Ala Lys Phe Ser lie Pne Gly Asp Val Tyr Asn 
65 70 75 80 



Leu Ala Gin Lys Leu Val Phe lie Lys Tyr Asp Leu Phe Pro Lys lie 
85 90 f 95 
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Trp His Met Ala Val\Ser Leu Leu Asn Ala Val Leu Pro Val Arg Phe 
100 \ 105 110 

His Met Val Ser Thr Val Ala Gin Ser Leu Cys Phe Leu Gly Leu Leu 
115 \ 120 125 

Ser Ser Leu Ser Thr Leii Val Asp Leu Pro Leu Ser Tyr Tyr Ser His 
130 \ 135 140 

Phe Val Leu Glu Glu Lys\ Phe Gly Phe Asn Lys Leu Thr Val Gin Leu 
145 150\ 155 160 

Trp lie Thr Asp Met lie Lys Ser Leu Thr Leu Ala Tyr Ala lie Gly 
165 \ 170 175 

Gly Pro lie Leu Tyr Leu pAe Leu Lys lie Phe Asp Lys Phe Pro Thr 
180 \ 185 190 

Asp Phe Leu Trp Tyr lie Med Val Phe Leu Phe Val Val Gin lie Leu 
195 \200 205 

Ala Met Thr He He Pro Val tehe He Met Pro Met Phe Met Lys Phe 
210 215 \ . 220 

Thr Pro Leu Glu Asp Gly Glu Lfeu Lys Lys Ser He Glu Ser Leu Ala 
225 230 \ 235 240 

Asp Arg Val Gly Phe Pro Leu Asb Lys He Phe Val He Asp Gly Ser 
245 \ 250 255 

Lys Arg Ser Ser His Ser Asn Ala\Tyr Phe Thr Gly Leu Pro Phe Thr 
260 \265 270 

Ser Lys Arg He Val Leu Phe Asp 'thr Leu Val Asn Ser Asn Ser Thr 
275 280 \ 285 

Asp Glu He Thr Ala Val Leu Ala Has Glu He Gly His Trp Gin Lys 
290 295 \ 300 

Met His He Val Asn Met Val He PhA Ser Gin Leu His Thr Phe Leu 
305 310 \ 315 320 

He Phe Ser Leu Phe Thr Ser He Tyr VArg Asn Thr Ser Phe Tyr Asn 
325 \?30 335 

Thr Phe Gly Phe Phe Leu Glu Lys Ser *fthr Gly Ser Ser Phe Val Asp 
340 345 \ 350 

Pro Val He Thr Lys Glu Phe Pro He lie He Gly Phe Met Leu Phe 
355 360 \ 365 

Asn Asp Leu Leu Thr Pro Leu Glu Cys Ala\ Met Gin Phe Val Met Ser 
370 375 \ 380 

Leu He Ser Arg Thr His Glu Tyr Gin Ala ksp Ala Tyr Ala Lys Lys 
385 390 J95 400 

Leu Gly Tyr Lys Gin Asn Leu Cys Arg Ala Lfeu He Asp Leu Gin He 
405 410 t 415 

Lys Asn Leu Ser Thr Met Asn Val Asp Pro Lei Tyr Ser Ser Tyr His 
420 425 \ 430 



Tyr Ser His Pro Thr Leu Ala Glu Arg Leu ThrlAla Leu Asp Tyr Val 
435 440 \ 445 
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Ser Glu Ly: 
450 



INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE:\amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 1 . . 315 

(D) OTHER INFORMATION : /note= "The sequence of the 
Rcelp protein ^rom yeast presented as a polypeptide 
sequence . " 



(xi) SEQUENCE DESCRIPTIC 

Met Leu Gin Phe Ser Thr 

1 5 

Tyr Val Leu Pro Leu Tyr 
20 

Asp Asn Pro Arg Thr lie 
35 

Leu lie Ser Asn Leu Phe 
50 



SEQ ID NO: 
?he Leu Val 



AAa Thr Ser 
25 

Lys\Ser Arg 
0 

Leu \A^1 Pro 
55 



Leu Leu 
10 

Gin Pro 
Met Gin 
Phe Leu 



Tyr 
Glu 
Lys 



Gin 
60 



lie Ser lie Ser 
15 

Gin Ser Lys Arg 
30 

Leu Thr lie Met 
45 

Ser Gin Leu Ser 



Ser Thr Thr Ser His lie Gly Phe\ Lys 
65 70 



lie lie Pro Gly Tyr Tyr 
85 



Ala Ala Leu 



Asp Ala Phe Leu Gly Leu Gly 
75 80 

Pro Asn Pro Trp Gin Phe Ser 
90 95 



Gin Phe Val Lys Asp Leu 
100 



Thr Lys C^ts 

io\ 



Val Ala Met 



Leu Leu Thr Leu 
110 



Tyr Cys Gly Pro Val Leu Asp Phe Val\ 
115 120 

Lys Ser Ser lie Leu Glu Asp Phe Tyr 
130 135 

Ser Phe Arg Asn Phe lie Phe Ala Pro 
145 150 



Leu Tyr 

[is Glu 



I3\e Thr 
155 



His 



Phe 
140 

Glu 



Leu Leu Asn Pro 
125 

Leu Asn lie Trp 



Glu lie Phe Tyr 
160 



Thr Ser Met Leu Leu Thr Thr Tyr Leu 
165 



Asn\ Leu 

170 



He 



Pro His Ser Gin 
175 



Leu Ser Tyr Gin Gin Leu 
180 

Ala His Ala His His Ala 
195 

Thr Val Ser He Leu Leu 
210 



Phe Trp Gin 
185 

Tyr Glu Gin 
200 

Thr Thr Cys 
215 



Pro Eer Leu 



Leu Cain Glu 



Phe Gin 



He 
220 



Phe Phe Gly Leu 
190 

Gly Ser Met Thr 
205 

Leu Tyr Thr ThrO 





Leu Phe Gly Gly Leu *£hr Lys Phe Val Phe Val Arg Thr Gly Gly Asn 

225 2?0 235 240 

Leu Trp Cys Cys lie lie Leu His Ala Leu Cys Asn lie Met Gly Phe 

245 1 250 255 

Pro Gly Pro Ser Phe Leli Asn Leu His Phe Thr Val Val Asp Lys Lys 

260 I 265 270 

Ala Gly Arg lie Ser Lys\ Leu Val Ser lie Trp Asn Lys Cys Tyr Phe 

275 \ 280 285 

Ala Leu Leu Val Leu Gly Leu lie Ser Leu Lys Asp Thr Leu Gly Thr 

290 235 300 



Leu Val Gly Thr Pro Gly Tvr Arg lie Thr Leu 
305 310 \ 315 
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